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1 Introduction 



Given a fundamental theory describing a quantum many-body system, one often needs to obtain a 
concise description of its low-energy dynamics by integrating out high-energy degrees of freedom. 
The Schrieffer- Wolff (SW) method accomplishes this task by constructing a unitary transformation 
that decouples the high-energy and low-energy subspaces. 

In the present paper we view the SW method as a version of degenerate perturbation theory. It 
involves a Hamiltonian Hq describing an unperturbed system, a low-energy subspace Vq invariant 
under Hq, and a perturbation eV which does not preserve Vq. The goal is to construct an effective 
Hamiltonian H^s acting only on Vq such that the spectrum of H^q reproduces eigenvalues of the 
perturbed Hamiltonian H^+eV originating from the low-energy subspace Vq. The SW transformation 
is a unitary operator U such that the transformed Hamiltonian U{Hq + eV)U'^ preserves Vq. The 
desired effective Hamiltonian is then defined as the restriction of U {HQ+eV)U^ onto Vq. An important 
consideration arising in the context of many-body systems such as molecules or interacting quantum 
spins is the locality of -ffeff- In order for the effective Hamiltonian to be usable, it must only 
involve fc-body interactions for some small constant k. We demonstrate that this is indeed the case 
for the SW method by proving (under certain natural assumptions) that it obeys the so called linked 
cluster theorem. 

The purpose of the present paper is two- fold. First, we give a self-contained summary of the SW 
method with a focus on rigorous results. A distinct feature of our presentation is the use of both 
perturbative and exact treatments of the SW transformation. The former is given in terms of the 
Taylor series and the rules for computingthe Taylor coefficients while the latter involves the so called 
direct rotation between linear subspace^j. By combining these different perspectives we are able to 
obtain elementary proofs of several important properties of the effective Hamiltonian ifeff such as 
its additivity under a disjoint union of non-interacting systems. These properties do not manifest 
themselves on the level of individual terms in the perturbative expansion which makes their direct 
proof (by inspection of the Taylor coefficients of H^q) virtually impossible. In addition, our approach 
provides a systematic diagram technique for computing high-order corrections to which can be 
readily cast into a computer program for practical calculations. 

Our second goal is to specialize the SW method to interacting quantum spin systems. We assume 
that the unperturbed Hamiltonian Hq is a sum of single-spin operators such that the low-energy 
subspace Vo is a tensor product of single-spin low-energy subspaces. The perturbation V is chosen 
as a sum of two-spin interactions associated with edges of some fixed interaction graph. We prove 
that the Taylor series for ifeff obeys the so called linked cluster theorem, that is, the m-th order 
correction to i^es includes only interactions among subsets of spins spanned by connected clusters 
of at most m edges in the interaction graph (one should keep in mind that the spins acted on by 
ifeff are 'effective spins' described by the low-energy subspaces of the original spins), see Theorem [2] 
in Section 14.31 It demonstrates that the SW transformation maps a high-energy theory with local 

-^The direct rotation can be thought of as a square root of the double reflection operator used in the Grover search 
algorithm [T]. 
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interactions to an effective low-energy theory with (approximately) local interactions. 

For macroscopic spin systems the norm of the perturbation ||eV^|| is typically much larger than 
the spectral gap of Hq and one should expect level crossings to occur between energy levels from the 
low-energy and high-energy subspaces. On the other hand, one should expect that for sufficiently 
small |e| ground states of Hq + eV have most of their weight inside the low-energy subspace Vq. We 
prove that this is indeed the case for so-called block-diagonal perturbations V, that is, perturbations 
preserving Vq. Specifically, we prove that the low-energy subspace Vq contains at least one ground 
state of Hq + eV as long as |e| < eo for some constant eo that depends only on the spectral gap of 
Ho, the degree of the interaction graph, and the strength of spin-spin interactions, see Lemma [4.11 
in Section I4.2[ In that sense, the ground state of the system has a constant stability radius against 
block-diagonal perturbations. 

The main technical result reported in the paper is a rigorous upper bound on the error 6m up to 
which the ground state energy of i^eff! truncated at the rrith order, approximates the ground state 
energy of the exact Hamiltonian Hq + eV, see Theorem [1] in Section H75l Specifically, we prove that 
there exist constants c^, such that 6m < CmN\e\'^~^^ for all |e| < e^, where is the total number 
of spins. The coefficients Cm and depend only on the degree of the interaction graph, the strength 
of spin-spin interactions, the spectral gap A of the unperturbed Hamiltonian Hq, and the truncation 
order m. It shows that the approximation error 6m is extensive (linear in A^) for fixed m. On the 
other hand, 6m may grow very rapidly as a function of m. Note that our bound holds even in the 
region of parameters \\€V\\ ^ A where the perturbative series for H^s are expected to be divergent. 
The proof of the bound on 6m combines the SW transformation, the linked cluster theorem, and the 
ground state stability against block-diagonal perturbations. 

One undesirable feature of the SW transformation is that its generator S = log (U) is a highly 
non-local operator which cannot be written as a sum of local interactions even in the lowest order in 
e. The fact that locality of the theory does not manifest itself on the level of the generator S can be 
safely ignored if one is interested only in the final expression for -ffeff- However the non-local character 
of S makes it more difficult to prove bounds on the truncation error 6m- We resolve this problem by 
employing a different version of the SW method due to Datta et al [2] which we refer to as a local 
SW method. By analogy with the standard SW it defines a unitary transformation U generating an 
effective low-energy Hamiltonian acting on Vq. The advantage of the local SW transformation is that 
its generator S = log (U) has nice locality properties which makes analysis of the truncation error 
6m obtained using the local SW method much easier, see Theorem [3] in Section 14.41 Unfortunately, 
the local SW transformation does not exist outside the realm of perturbative expansions lacking 
an 'exact definition' similar to the standard SW. We establish equivalence between the local and 
the standard versions of the SW method by showing that the corresponding m-th order effective 
Hamiltonians can be mapped to each other by a unitary rotation of the low-energy subspace Vq up 
to an error 0{Ne"^~^^). 
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1.1 Applications of the SW method 

The SW transformation finds application in many contemporary problems in quantum physics, where 
an economical description of the low-energy dynamics of the system must be extracted from a full 
Hamiltonian. While SW is named in honor of the authors of an important paper in condensed matter 
theory [3], where the celebrated Kondo Hamiltonian was shown to be obtained by such a transfor- 
mation from the equally celebrated Anderson Hamiltonian, it is in fact not the earliest application of 
the technique. The concept of a most-economical rotation between subspaces has a several-centuries 
history in mathematics jHISj. Its direct descendent is the cosine-sine (CS) decomposition [6j, which 
is very closely related to the definition of the exact SW transformation that we adopt here. The 
earliest application of the canonical transformation in quantum physics is in fact at least 15 years 
earlier than [3], in the famous transformation of Foldy and Wouthuysen [7]. The authors of [7] used 
the technique to derive the non-relativistic Schroedinger-Pauli wave equation, with relativistic cor- 
rections, from the Dirac equation. In this case, the canonical transformation is one that decouples 
the positive and negative energy solutions of the relativistic wave equation. 

The applications of SW in modern times [5] are far too numerous even to allude to. The work of 
one of us uses SW routinely in many apphcations (see, e.g., Ref. [9]). Since part of our concern in the 
present work is a systematic development of a proper series expansion for the SW, we can mention 
that high-order terms of this series have been carefully and laboriously developed in some important 
works [lOj and may be found recorded in recent books dealing with applications [TT]. One can, 
however, find lengthy discussions of possible alternative approaches [121 [13]. It is widely acknowledged 
that SW is preferable to various other approaches, for example the Bloch expansion [HI [15], which, 
unlike SW, produces non-Hermitian effective Hamiltonians at sufficiently high order. SW is also 
superior to the self-consistent equation for the effective Hamiltonian arising from the diagrammatic 
self-energy technique [ISl [13 [IB] • 

It often occurs that the effective low-energy Hamiltonian has much higher level of complexity com- 
pared to the original high-energy Hamiltonian. This observation has lead to the idea of perturbation 
gadgets [m Uni [20] . Suppose one starts from a target Hamiltonian i^target chosen for some interesting 
ground state properties. The target Hamiltonian may possibly contain many-body interactions and 
might not be very realistic. Perturbation gadgets formalism allows one to construct a simpler high- 
energy simulator Hamiltonian with only two-body interactions whose low-energy properties (such as 
the ground state energy) approximate the ones of -^target- For example, various perturbation gadgets 
have been constructed for target Hamiltonians with the topological quantum order [211 [221 [23]. The 
SW method provides a natural framework for constructing and analyzing perturbation gadgets [2^ . 
In contrast to other methods, SW does not require the unphysical scaling of parameters in the simula- 
tor Hamiltonian required for convergence of the perturbative series [23] . This makes the SW method 
suitable for analysis of experimental implementations of perturbation gadgets with cold atoms in 
optical lattices, see [25} [26]. 



5 



1.2 Comparison between SW and other perturbative expansions 

In this section we briefly review some of the commonly used perturbative expansion techniques and 
argue that none of them can serve as a fully functional replacement of the SW method. 

We begin by emphasizing that the effective Hamiltonian i^eff is only defined up to a unitary 
rotation of the low-energy subspace Vq. Therefore one should expect that different versions of a 
degenerate perturbation theory produce different Taylor series for ifcfi- In particular, the error up 
to which the series for ifefr truncated at some finite order reproduce the exact low-energy spectrum 
may depend on the method of computing Hes- Different methods may also vary in the complexity 
of rules for computing the Taylor coefficients. 

The most convenient and commonly used perturbative method is the Feynman-Dyson diagram 
technique [16], [T71 127] . It can be used whenever the unpertubed ground state obeys Wick's theorem. 
Unfortunately, the standard derivation of the Feynman-Dyson expansion (see, e.g., Ref. [17]) which 
relies on the adiabatic switching of a perturbation can only be applied to non-degenerate ground 
states. It was recently shown explicitly that the Gell-Mann and Low theorem used in the derivation 
of Feynman-Dyson series fails for degenerate ground states [2H] • 

Exact quasi-adiabatic continuation [291 EH] provides an alternative path to defining i/es via a 
unitary transformation starting from a full Hamiltonian. This method however requires a constant 
lower bound on the energy gap of a perturbed Hamiltonian which is typically very hard to check. 

Traditional textbook treatment of a degenerate perturbation theory [211 [221 ESj is formulated in 
terms of perturbed and unperturbed resolvents 

G{z) = {zl -Ho- eV)-^ and G{z) = {zl - Hoy\ 

An effective low-energy Hamiltonian can be obtained from G{z) using the self-energy method [161 
[T71 [T5] . To sketch the method, we shall adopt the standard notations by writing any operator O as 
a block matrix 

0+ 



O 



where the two blocks correspond to the low-energy and the high-energy subspaces respectively. One 
can define an effective low-energy resolvent as G-{z), that is, the low-energy block of G{z). Its 
importance comes from the fact that for sufficiently small e eigenvalues of Hq + eV originating from 
the low-energy subspace coincide with the poles of G_ (z) , see [18] . The poles of G_ (z) can be found 
using the Dyson equation [T7| 

G4z) = G_iz) + G4z)^iz)G_iz), 

where 

oo 

S(z) = eV^ + J2 e'^" V^+{G+{z)V+rG+{z)V+^ 

n=0 

is the so-called self-energy operator acting on the low-energy subspace. The Dyson equation yields 



G4z)-^ = G4z)-^ - = zl- (ifo)~ - S( 



z 
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Hence z is an eigenvalue of i/o + originating from the low-energy subspace (the pole of G'_(2;)) 
iff z is an eigenvalue of (-ffo)- + ^(-2)- In that sense one can regard Hes{z) = (-ffo)- + ^(2;) as a 
z-dependent effective Hamiltonian on the low-energy subspace. Although in many important cases 
the z-dependence of Hes{z) can be neglected [IB], there is no systematic way of getting rid of this 
dependence. This is the main disadvantage of the self-energy formalism compared with the SW 
method. 

Another commonly used version of the degenerate perturbation theory is the expansion due to 
Bloch |U|32l[T5], see [201 122] ior some recent applications. The Bloch expansion is conceptually very 
similar to the SW method. It provides a systematic way of constructing an "effective Hamiltonian" 
acting on the low-energy subspace, but this Hamiltonian is generally not hermitian and as such it 
does not describe any physical theory. To make this point more clear let us recap the main steps of 
the Bloch expansion, see for instance [151 ISO]- Let {\'ipj)}j be the eigenvectors Hq + eV originating 
from the low-energy subspace and {\j}j be the corresponding eigenvalues. Decompose 

\^pj) = \aj) + \aj~), where \aj) E Vq and \af) G Vq. 

Define an operator U such that U \aj) = \ipj) and U l^p) = for all ip G Vq. The operator U 
is analogous to the (inverse) SW transformation, although it is not unitary. Using the notations 
Eq. (11. ip one can write U as 



It shows that W is a (non-hermitian) projector onto the perturbed low-energy subspace. Define the 
effective Hamiltonian acting on Vq as 

H,s = Po{Ho + eV)U, 

where Pq is the projector onto Vo- It follows that 

ifeff = Po{Ho + eV) |^,) = A,- Pq iV-j) = A, 

Thus one can regard Heg as a counterpart of the effective Hamiltonian in the SW theory — it acts 
only on the unperturbed low-energy subspace and its spectrum reproduces eigenvalues of Hq + eV 
originating from the low-energy subspace. However, since \aj) do not constitute an orthonormal basis 
of Vo, the operator H^g is not hermitian, and thus it cannot be regarded as a physical Hamiltonian. 

The advantage of the Bloch expansion is that the perturbative series for Hes has somewhat 
simpler structure compared with the corresponding expansion in the SW method. Indeed, using the 
fact that W is a projector one easily gets 

[Ho + eV,U]U = 0. (1.2) 

Using the identities PqU = Pq and UPq =U one can rewrite Eq. (11.21) as 

[Ho,U] = -e[V,U]U = 0. (1.3) 

Expanding U = Yl'^=o ^n^^ with Uq = Pq and using Eq. (I1.3P one can express W„ as a second-degree 
polynomial in Wi, . . . ,Un-i which provides a recursive rule for computing W„. This leads to a simple 
diagram technique for computing the Taylor coefficients of i^es, see [15i |20j for details. 
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1.3 Organization of the paper 

To be self-contained, the paper repeats some well-known results such as the low-order terms in the 
SW series. However we present these results in a very systematic and compact form that we believe 
might be useful for other workers. 

Sections provide an elementary introduction to the SW method for finite-dimensional Hilbert 
spaces. Section [2] summarizes the definition and basic facts concerning the direct rotation between 
linear subspaces. The direct rotation was originally introduced in the context of perturbation theory 
by Davis and Kahan in [4]. Although we hardly discover any new properties of the direct rota- 
tion, some of the basic facts such as the behavior of the direct rotation under tensor products (see 
Section 12. 4p might not be very well-known. 

Section |3] defines the SW transformation as the direct rotation from the low-energy subspace of 
a perturbed Hamiltonian Hq + eV to the low-energy subspace of Hq. This definition provides an 
elementary proof of additivity of H^q. Namely, given a bipartite system AB with no interactions 
between A and B, the effective Hamiltonians Heg[AB], Hes[A], and HeG[B] describing the joint 
system and the individual systems respectively are related as 

H,s[AB] = H,s[A] ®Ib + Ia^ H,s[B]. 

Here Ia, Ib are the identity operators acting on the low-energy subspaces of A and B. We note 
that the above additivity does not hold on the entire Hilbert space as one could naively expect. In 
Section 13.21 we compute the Taylor series for the SW transformation and Hcs- Our presentation 
uses the formalism of superoperators to develop the series. It allows us to keep track of cancelations 
between different terms in a systematic way and obtain more compact expressions for the Taylor 
coefficients. A convenient diagram technique for computing i^es is presented in Section 13.31 Con- 
vergence of the series is analyzed in Section 13.41 To the best of our knowledge, the approach taken 
in Sections I3.2f3.3f3.4l is new. Our main technical contributions are presented in Section H] that 
specializes the SW method to weakly interacting spin systems. 
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2 Direct rotation between a pair of subspaces 



The purpose of this section is to define a direct rotation between a pair of hnear subspaces and derive 
its basic properties such as a multiphcativity under tensor products. 

2.1 Rotation of one-dimensional subspaces 

Let l-L be any finite-dimensional Hilbert space. For any normahzed state & T-L define a refiection 
operator i?^ that fiips the sign of and acts trivially on the orthogonal complement of ip, that is, 

R^ = I-m{i;\. (2.1) 

Given a pair of non-orthogonal states ^Ti we would like to define a canonical unitary operator 
mapping ip to (p up to an overall phase. Let us first consider the double refiection operator 
R^R^. It rotates the two-dimensional subspace spanned by ip and (p by 26*, where 6 G [0, 7r/2) is the 
angle between ip and (p. In addition, R^R^ acts as the identity in the orthogonal complement to ip 
and (p. Thus we can choose the desired unitary operator mapping from ip to (p as U^^^ = ^JR^R^ 
assuming that the square root is well-defined. 

Definition 2.1. Let ip,(p be any non-orthogonal states. Define a direct rotation from ip to (p as 
a unitary operator 

U^^tj, = \/ R<t>Rip (2-2) 
Here y/z is defined on a complex plane with a branch cut along the negative real axis such that \/l = 1 . 

It is worth pointing out that any unitary operator is normal and thus the square root ^JR^R^ is well 
defined provided that R^R^ has no eigenvalues lying on the chosen branch cut of the ^/z function. 
The following lemma shows that U^^^f, is well defined and performs the desired transformation. 

Lemma 2.1. Let ip,(p eH be any non- orthogonal states. Then the double reflection operator R^R^ 
has no eigenvalues on the negative real axis. Furthermore, fix the relative phase of ip and cp such that 
{iplcp) is real and positive. Then U^^^ \ip) = \(p). 

Proof. Since t/^^^ acts as the identity on the subspace orthogonal to ip and 0, it suffices to consider 
the case Ti = C^. Without loss of generality {ip) = |1) and \(p) = sin (6') |0) + cos (6') where, by 
assumption, < 9 < tt/2. Using the definition Eq. (12.11) one gets 

R^ = a' and R^ = cos (29) a' - sin {29) a^. 

It yields 

R^R^ = cos (2^) I + isin (29) = exp (2z^a^). 
Since < 20 < vr, no eigenvalue of R^R^ lies on the negative real axis and thus 

U^^^ = ^/R^R^ = exp (19(7^) 

is uniquely defined. We get U^^^ |1) = sin (9) |0) + cos (9) |1), that is, U^^ff, \ip) = \4>)- □ 
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Later on we shall need the following property of the direct rotation. 



Corollary 2.1. Let ip^ip ^l-i he any non- orthogonal states, P = and Pq = The direct 

rotation from ip to cf) can he written as U^^^ = exp (S) where S is an anti-hermitian operator with 
the following properties: 

• PSP = PoSPo = {I- P)S{I - P) = (J - Po)S(J - Fo) = 0, 

• ll^ll < V2. 

Proof. Indeed, we can assume that {ip) = |1) and |0) = sin (6*) |0) + cos {6) |1) for some 6 G [0, 7r/2). 
Choose S = iOa^ and = in the orthogonal complement to ip and 0. By assumption, \\S\\ = 
\9\ < 7r/2. A simple algebra yields (iplSlip) = {4>\S\(j)) = 0. Restricting all operators on the two- 
dimensional subspace spanned by ip and one gets (/ — P)S{I — P) = {Tp-^\S\tl)-^) and (/ — Po)S{I — 
Po) = {4>'^\S\4>'^), where \ip-^) = |0) and 10-*-) = cos(6')|0) — sin(^^)|l). A simple algebra yields 

(V^^|S|V^^) = (0^1^10^) = 0. □ 



2.2 Rotation of arbitrary subspaces 

Let V,Vo C H be a pair of linear subspaces of the same dimension and P, Pq be the orthogonal 
projectors onto V, Vq. We would like to define a canonical unitary operator U mapping VtoVo. By 
analogy with the non-orthogonality constraint used in the one- dimensional case we shall impose a 
constraint 

IIP- Poll <1. (2.3) 
The meaning of this constraint is clarified by the following simple fact. 

Proposition 2.1. Condition ||P — Po|| < 1 holds iff no vector in V is orthogonal to Vq and vice 
verse. In particular, ||P — Po|| < 1 implies that dimP = dimPo- 

Proof. For any choice of the projectors P, Pq the spectrum of P — Pq lies on the interval [—1, 1] which 
implies ||P — Po|| < 1. Suppose ||P — Po|| = 1. Then there exists ip such that (P — Pq) = ^\ip)- 
This is possible only if ('0|P|'0) = 1, {ilj\Po\ip) = or vice verse. Thus ip E V and ip G V^^, or vice 
verse. Conversely, if such a vector tp exists, then (P — Pq) |-0) = ±1^/^), that is, ||P — Po|| = 1. By the 
same token, if we assume that dim V > dim Vq (or vice verse) there must exist a vector %p eV that 
is orthogonal to Vq (or vice verse), that is, ||P — Po|| = 1. □ 

For any linear subspace V define a reflection operator R-p that flips the sign of all vectors in V 
and acts trivially on the orthogonal complement to P, that is, 

Rv = 2P- L (2.4) 

Following |4j let us define a direct rotation between a pair of subspaces as follows. 
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Definition 2.2. Let ^/z be the square-root function defined on a complex plane with a branch cut 
along the negative real axis and such that s/T = 1. A unitary operator 



U = a/ RvoR- 



■V 



(2.5) 



is called a direct rotation from V to Vq- 

Lemma 2.2. Suppose \\P — Poll < 1- Then no eigenvalue of R-p^R-p lies on the negative real axis, 
so that U is uniquely defined by Eq. Ii2.3^) and 



It is worth pointing out tliat tlie direct rotation U can also be defined as the 'minimal' rotation 
that maps V ioV^. More specifically, among all unitary operators V satisfying VPV'^ = Pq the direct 
rotation differs least from the identity in the Frobenius norm, see 0]. If, in addition, ||P-Po|| < VS/2, 
the Frobenius norm can be replaced by the operator norm [3]. 

Proof of Lemma 12.21 It is well-known that any pair of projectors can be simultaneously block- 
diagonalized with blocks of size 2x2 and 1x1, and such that all 2 x 2 blocks are rank-one projectors, 
see Chapter 7.1 of [S]. It follows that all operators P, Pq, Pp, Ppg, ?7 can be simultaneously block- 
diagonalized with blocks of size 2x2 and 1x1. 

Let us first consider some 1x1 block. The restriction of P and Pq on this block are scalars. The 
condition ||P — Po|| < 1 implies that P = Pq = or P = Pq = 1. Thus R-p = R-p^ = ±1. In both 
case Rpf^Rp = 1 and thus U = 1. Thus the identity UPW = Pq holds for any 1x1 block. 

Let us now consider some 2x2 block. In this block one has P = and Pq = \4>){4>\ for some 

one-qubit states ip,(f) & C^. The condition ||P — Po|| < 1 implies that and are non-orthogonal. 
In addition, the restriction of U onto the considered block coincides with the direct rotation U^^^, 
see Eq. (12. 2p . Therefore Lemma [2.11 implies that UPW = Pq in any 2x2 block. □ 

For the later use let us point out one extra property of the direct rotation which follows directly 
from the above proof and Corollary 12.11 

Corollary 2.2. The direct rotation from V to Vq can be written as U = exp (S) where S is an 
anti-hermitian operator with the following properties: 



UPU^ = Pr 



(2.6) 



• PSP = PoSPo = (/ - P)S{I -P) = {I- Po)5(/ - Po) = 



Proof. Indeed, apply Corollary 12. II to each block in the decomposition of P, Pq, and U. 



□ 
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2.3 Generator of the direct rotation 

The exphcit formula Eq. (12. 5p for the direct rotation U is not very useful if one needs to compute U 
perturbatively. In this section we describe an alternative definition of the direct rotation in terms of 
its generator, that is, an antihermitian operator S such that U = exp (S). We shall see later that the 
Taylor coefficients in the perturbative series for S can be computed using a simple inductive formula. 

Lemma 2.3. Suppose ||P — Poll < 1- Then there exists a unique anti-hermitian operator S with the 
following properties: 

(1) expiS)Pexpi-S) = Po, 

(2) S is block- off- diagonal with respect to Pq, that is, 

PoSPo = and (/ - Po)S{I - Pq) = 0. (2.7) 

(3) \\S\\ < 7r/2. 

The unitary operator U = exp (S) coincides with the direct rotation from V to Vq. 

Proof. We already know that there exists at least one operator S with the desired properties, see 
Corollary 12.21 Let us show that it is unique. Indeed, write S as 

where the first and the second block correspond to the subspaces Vq and Vq respectively. Computing 
the exponent exp {S) yields 



where 

f/i,i = cos (A), A= \fSi^2S]^2 aiid f/2,2 = COS (P), B = \fsl^2^2- 
Note that ||A|| = ||i?|| = \\S\\ < 7i/2. Hence Ui^i and 1/2,2 sxe hermitian positive-definite operators, 

f/1,1 > and t/2,2 > 0. (2.9) 

Let S be any anti-hermitian operator satisfying the three conditions of the lemma and let U = exp {S). 
We have to prove that U = U and S = S. Indeed, using the identity 

{UU^)PoiUU^y = UPU^ = Po 

we conclude that UW commutes with Pq. This is possible only if UU"^ = L for some block- diagonal 
unitary L, that is, 

L=(\' I), L,L\ = I, LlL2 = I. 



2 



It follows that U = LU, that is f/i^i = Lif/i 1 and 1/2,2 = -^2^^2,2- Combining it with Eq. (12.91) we 
arrive at Li = / and L2 = / as follows from the following proposition. 
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Proposition 2.2. Let H be a positive operator and L be a unitary operator. Suppose that LH is 
also a positive operator. Then L = I. 

Proof. Indeed, consider the eigenvalue decomposition L = Since LH > one gets 

{a\LH\a) = e*^"(a|if|a) > 0. Since {a\H\a) > we conclude that e*"" > 0, that is, e'"" = 1. Hence 
L = L □ 

To summarize, we have shown that U = U . Since the matrix exponential function exp (M) is 
invertible on the subset of matrices satisfying ||M|| < 7r/2, we conclude that S = S. □ 



2.4 Weak multiplicativity of the direct rotation 

Consider a bipartite system of Alice and Bob with a Hilbert space "H = Ti^ ® "H^. Let P^, P^^ be 
a pair of Alice's projectors acting on Ti^. Similarly, let P^, Pq be a pair of Bob's projectors acting 
on TL^ . We shall assume that 

II - Pq'^II < 1 and II - P(f II < 1 (2.10) 
such that one can define local direct rotations and , 

Consider also the global direct rotation U^^ mapping P^ (g) P^ to P^^ (g> Pq^, that is, 

[/^^(P^ ® P^)(f/^^)t = Po^ ® Po^. 

It is worth pointing out that in general U^^ ^ U^®U^ . Indeed, consider a special case when P^ and 
P^ project onto one-dimensional subspaces of Alice and Bob. Then clearly P"^ ® P^ is a rank-one 
projector. In this case U"^^ is a rotation in some two-dimensional subspace of "H, see Section 12.11 
On the other hand ® is a tensor product of a rotation in some two-dimensional subspace of 
'H'^ and a rotation in some two-dimensional subspace of l-i^ . Therefore ® acts non-trivially 
on some four-dimensional subspace of "H and hence U^^ ^ U"^ ® . This example shows that the 
direct rotation is not multiplicative under the tensor product. Nevertheless, it features a certain 
weaker form of multiplicativity which we derive in this section. 

To start with, we need to check that the direct rotation U"^^ is well-defined. It follows from the 
following proposition. 

Proposition 2.3. Suppose Alice's projectors P^^Pq and Bob's projectors P^^Pq satisfy non- 
orthogonality constraints Eq. 112. 10\) . Then 

||P^(g)P^-P(f ®Po^|| < 1. (2.11) 
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Proof. Let us assume that Eq. (12.111) is false. Then there exists a state ip E Ti such that P^®P^ = 
and P^ ® Pq^ |^) = {ip) (or vice verse). Below we show that it leads to a contradiction. Indeed, 
the condition \\P^ — Pff^W < 1 implies that there exists a > such that P^ — P^^ < (1 — a) I. 
Multiplying this inequality on both sides by P^ we arrive at P^ — P^P^^P^ < (1 — a)P^, that is, 
pApApA > ^pA_ Similarly, there exists /3 > such that P^P^P^ > (3P^. Thus 

(P^ ® P'')(Po^ ® Po){P^ ® P"") = (P^P^P^) 8) {P^'P^P'') > al3P^ ® P^. (2.12) 

Here we used the fact that a tensor product of positive semi-definite operators is a positive semi- 
definite operator. Computing the expectation value of Eq. (I2.12p on one gets > a/? which is a 
contradiction. □ 

We conclude that the global direct rotation U"^^ mapping P^ ® P^ to Pq ® P^ is well-defined. 
The relationship between the global and the local rotations which we shall call a weak multiplicativity 
is established by the following lemma. 

Lemma 2.4 (Weak multiplicativity). Let U^, , and U^^ he the direct rotations defined above. 
Then 

U^^ (P^ ® P^) = {U^ ® t/^)(P^ ® P^). (2.13) 

Proof. Let us perform the simultaneous block-diagonalization of P"^, Pf^ and P^ , Pq with blocks of 
size 2x2 and 1 x 1, as in the proof of Lemma [2^ Recall that each 2x2 block is a rank-one projector. 
Hence it suffices to check Eq. (12.131) only for the case when "H"^ = T-L^ = and rank-one projectors 
pA pB pA pB QJioose normalized states ip^, ip^, i/jq, i/jq in the range of the above projectors, 
and fix their relative phase such that (ip'^l'ipo) > and {ip^\ipo) > 0. Lemma l2.ll implies that 
\^jJ'^) = \-ipQ) and = \ipo)- However, since {'iIj^®iI)^\'4)q®iPq) > 0, Lemma [2?T] also implies 

that U^^ \^p^0^|J^) = 1?/'^®^^). This we have an identity U^^{P^(g)P^) = {U^®U^){P^(g)P^) in 
each tensor product of 2 x 2 blocks. Similar arguments hold if one or both blocks have size 1x1. □ 

3 Effective low-energy Hamiltonian 

3.1 SchriefFer- Wolff transformation 

Let "H be a finite-dimensional Hilbert space and Hq be a hermitian operator on "H. We shall refer to 
Hq as an unperturbed Hamiltonian. Let Xq C M be any interval containing one or several eigenvalues 
of Hq and let Pq ^ ^ be the subspace spanned by all eigenvectors of Hq with eigenvalue lying in Xq. 
We shall say that Hq has a spectral gap A iff for any pair of eigenvalues A, rj such that A G Xq and 
rj ^ Iq one has |A — ?7| > A. In other words, the eigenvalues of Hq lying in Xq must be separated 
from the rest of the spectrum by a gap at least A. 

Consider now a perturbed Hamiltonian H = Hq + eV, where the perturbation V is an arbitrary 
hermitian operator on H. Let X C M be the interval obtained from Iq by adding margins of thickness 
A/2 on the left and on the right of Xq. Let P C H be the subspace spanned by all eigenvectors of 
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H with eigenvalue lying in X. In this section we shall only consider sufficiently weak perturbations 
that do no close the gap separating the interval Xq from the rest of the spectrum. Specifically, we 
shall assume that |e| < ec, where 

Since the perturbation shifts any eigenvalue at most by ||e\^||, we conclude that the eigenvalues of 
H lying in X are separated from the rest of the spectrum of if by a positive gap as long as |e| < ec- 
In particular, it implies that Vq and V have the same dimension. Let Pq and P be the orthogonal 
projectors onto Vq and V. Introduce also projectors Qo = I — Pq and Q = I — P. 

Lemma 3.1. Suppose e is real and |e| < Cc. Then \\Pq — P\\ < 2||eV||/A < 1. 

Proof. Let T = P — Pq. A simple algebra shows that is block-diagonal with respect to Pq and 
Qo, that is PqT'^Qq = and QqT'^Pq = 0. The remaining diagonal blocks of are PqT'^Pq = PqQPq 
and QqT'^Qo = QqPQo- Since T is hermitian, we have ||T|| = a/HT^H and thus 

||T|| = max {y/\\PoQPQ\\, ^/IQ^PQ4) = max (||PoQ||, \\QoP\\). 

Let us assume that ||T|| = ||-PoQ|| (the case ||T|| = ||PQo|| is dealt with analogouslj|§) . Define 
auxiliary operators A = PqHqPq, B = QHQ, and X = PqQ. Then one has 

AX -XB = Pq{Hq - H)Q = -ePoVQ = Y. (3.2) 

The equation AX — XB = F is known as the Sylvester equation. In particular, it is known that 
||Ar|| < whenever spectrums of A and B can be separated by an annulus of width 5 in 

the complex plane, see for instance. Theorem VII. 2. 11 in f6]. In our case the spectrums of A and B 
considered as operators on Pq and Q respectively are separated by an annulus of width A/2 whenever 
|e| < ec- Thus we arrive at 

||X|| < (2/A) < (2/A) ||eF|| < 1 
as long as |e| < ec- □ 

Lemma [3. II implies that the direct rotation U from V to Vq is well-defined as long as |e| < ec- By 
definition, it satisfies UPW = Pq and UQW = Qq- In addition, since the perturbed Hamiltonian 
H = Hq + eV is block-diagonal with respect to P and Q, we conclude that the transformed Hamilto- 
nian UHW is block-diagonal with respect to Pq and Qq- In almost all applications of perturbation 
theory the interval Iq includes only the smallest eigenvalue of Hq or all sufficiently small eigenval- 
ues. By a slight abuse of notations we shall adopt the standard terminology and refer to Vq as the 
low-energy subspace of Hq. However, one should keep in mind that all results of this section remain 
valid for an arbitrary choice of the interval Xq. 

Definition 3.1. The direct rotation U from V to Vq is called the Schrieffer- Wolff (SW) transfor- 
mation for the unperturbed Hamiltonian Hq, perturbation eV , and the low-energy subspace Vq. The 
operator H^s = PqU{Hq + eV)WPQ is called an effective low-energy Hamiltonian. 



^It is worth mentioning that \\P — Po\\ = \\PoQ\\ = \\QoP\\ for any projectors P, Pq satisfying \\P — Poll < 1- These 
identities can be easily proved using the simultaneous block-diagonalized form of P and Pq. 
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3.2 Derivation of the perturbative series 

Let U be the Schrieffer- Wolff transformation constructed for some unperturbed Hamiltonan Hq, a 
perturbation eV, and tlie low-energy subspace Vq. In tliis section we always assume that |e| < ec, 
so U is well-defined, see Lemma 13.11 In most of applications the explicit formula Eq. (12 .Sp for 
the SW transformation cannot be used directly because the low-energy subspace V of the perturbed 
Hamiltonian Ho + eV is unknown. In this section we explain how one can compute the transformation 
U and the effective low-energy Hamiltonian Hcs perturbatively. Truncating the series for Hes at some 
finite order one obtains an effective low-energy theory describing properties of the perturbed system. 
Throughout this section all series are treated as formal series. Their convergence will be proved later 
in Sectioning 

We begin by introducing some notations. The space of linear operators acting on "H will be 
denoted L('H). A linear map O : L('H) — i- L('H) will be referred to as a superoperator. We shall 
often use superoperators 

0{X) = PoXQo + QoXPo and V{X) = PqXPo + QqXQo. 

We shall say that an operator X is block- off- diagonal iff 0{X) = X. An operator X is called 
block- diagonal iff V^X) = X. Decompose the perturbation as = Vd + Kd, where 

= V{V) and Vod = 0{V) 

are block-diagonal and block-off-diagonal parts of V. Given any operator Y G L('H) define a super- 
operator Y describing the adjoint action of Y, that is, Y{X) = [Y,X]. 

Recall that the SW transformation can be uniquely represented as U = exp [S) where S is an 
anti-hermitian generator which is block-off-diagonal and \\S\\ < 7r/2, see Lemma 12.31 whereas the 
transformed Hamiltonian e^{HQ-\-eV)e~^ is block-diagonal. Combining these conditions would yield 
Taylor series for S, which, in turn, yields Taylor series for ifes- We begin by rewriting the transformed 
Hamiltonian as 

exp (5) {Ho + eV) = cosh (5) {Ho + eV^) + sinh {S) (eKd) + sinh (5) {Ho + eVd) + cosh (5) (eKd) • (3.3) 

Taking into account that 5* is block-off-diagonal, we conclude that the first and the second terms in 
the righthand side of Eq. (13. 3p are block- diagonal, while the third and the fourth terms are block- 
off-diagonal. In order for the transformed Hamiltonian to be block-diagonal, S must obey 

sinh {S){Ho + eVd) + cosh (^)(eKd) = 0. 

Since our goal is to derive formal Taylor series, S can be regarded as an infinitesimally small operator. 
In this case the superoperator cosh {S) is invertible and thus the above condition can be rewritten as 

tanh {S){Ho + eV^) + eKd = 0. (3.4) 
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Now we can rewrite the transformed Hamiltonian as 

exp{S){Ho + eV) = cosh (5)(ifo + e^d) + sinh (5)(eKd) 

= Ho + eVd + (cosh (S) -l){Ho + eV^) + sinh (5)(eKd) 

= Ho + eV^ + tanh {S){Ho + eV^) + sinh (5)(eKd). 

tanh [S) 

Note that (cosh (5) — 1)/ tanh {S) is well defined for infinitesimally small S by its Taylor series. Using 
Eq. (13 ■4p we arrive at 

exp {S)iHo + eV) = Ho + eV^ + F(5)(eKd), 

where 

cosh (x) - 1 

r (x) = smh (x) — . 

^ ^ ^ ^ tanh(x) 

A simple algebra shows that F{x) = tanh {x/2), so we finally get 

exp {S){Ho + eV) = Hq + eV^ + tanh (5/2) (e^d). (3.5) 

In order to solve Eq. fl3.4p for 5* let us introduce some more notations. Let be an orthonormal 
eigenbasis of Hq such that Hq \i) = Ei \i) for all i. We shall use notation i G Xq as a shorthand for 
Ei G Xq. Define a superoperator 

Ei — Ei 



,(X).^M>|.)01. (3.6, 



Note that {i\0{X)\j) = whenever i,j G Xq or i,j ^ Xq. Let us agree that the sum in Eq. (13.61) 
includes only the terms with i G Xq, j ^ Xq or vice verse. In this case the energy denominator can 
be bounded as \Ei — Ej\ > A. One can easily check that 

C{[Ho,X]) = [Ho,CiX)] = OiX) (3.7) 

for any operator X G L(7{). It is worth mentioning that C maps hermitian operators to anti- 
hermitian operators and vice verse. 

Treating S as an infinitesimally small operator we can rewrite Eq. (13.41) as 

S{Ho + eVd) + Scoth (5)(eKd) = 0. 

Using Eq. (13.71) and the fact that 5* is block-off-diagonal one gets 

S = CSieVd) + CScoth{S){eVod). (3.8) 

We can solve Eq. (13.81) in terms of Taylor series, 

oo 

^ = 5^S„e", Si = -Sn. (3.9) 

n=l 
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Let us also agree that Sq = 0. We shall need the Taylor series 



xcothfx) = > a2„a;^", = r—, (3.10) 

ml 

n=0 



where Bm are the Bernoulli numbers. Then we have 



^1 = CiVod), 

Sn = -£Ki(S„_i) + 5^a2,/:^'^(Kd)n-i forn>3. (3.11) 



Here we used a shorthand 



S'{Kd)m= Yl '5„,i---S'„,(Kd). (3.12) 

ni,..., nfc>l 
?ii+...+?ij.=m 

Note that the righthand side of Eq. (13. lip depends only on S'l, . . . , Sn-i- Hence Eq. (13.111) provides 
an inductive rule for computing the Taylor coefficients S'„. Projecting Eq. ( 13. Sp onto the low-energy 
subspace one gets 

oo 

= HoPo + ePoVPo + ^ e'^i/efr.n, i^eff,n = ^2.-1^0 S^'-\VoA)n-iPo. (3-13) 

n=2 j>l 

Here &2i-i are the Taylor coefficients of the function tanh(x/2). More explicitly, 

tanh(x/2) = f;62„-,x2^-\ &2n^i = (3.14) 



n=l 



Note that -ffeff.n depends only upon 5*1, ... , 5',„_i. To illustrate the method let us compute the Taylor 
coefficients -ffeff.n for small values of n (a systematic way of computing H^^ n in terms of diagrams is 
described in the next section). From Eq. (13. lip one easily finds 

S, = -CV^{Ss) + a2CiS,S2 + S2S,)iV,^). 

From Eq. (I3.13p one gets 

H,s,2 = &iPo^i(Kd)Po, (3.15) 

i^cfr,3 = hPoS2iVod)Po, (3.16) 

HeSA = hPoSsiVo^)Po + hPoSUVod)Po, (3.17) 

Hes,5 = hPoS,iV,^)Po + hPoiS2S! + S,S2Si + S!S2)iVo^)Po. (3.18) 
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Figure 1: The value of the Taylor coefficients a2n and 62n-i for small n. 



Let us obtain explicit formulas for ifcfi.a and i/eff,4 that include only Si. Using the identity 5*2 (Kd) = 
-Vod{S2) we get 

i^efr,3 = hPoVodCVa{Si)Po (3.19) 

and 

Substituting 5*3 into the expression for H^s^i and using the identity S^iVod) = —Vod{Ss) one gets 



eSA 



(3.20) 



The explicit values of the Taylor coefficients a2n and 62n-i are listed in Table [T] The formula for 
ifeff,4 can be slightly simplified in the special case when Xq contains a single eigenvalue of Hq, that 
is, the restriction of Hq onto Pq is proportional to the identity operator. In this special case one can 
use an identity 

PolCiX), OiY)]Po = -Po[0{X), £(r)]Po (3.21) 

which holds for any operators X, Y. One can easily check Eq. (13.211) by computing matrix elements 
(^l ■ |j) of both sides for i,j G Xq. Using Eq. (13.211) and explicit values of a's and 6's one can rewrite 
HeSA as 



H, 



eff,4 



IsKVod) - \voaimnsi: 



Pn 



(3.22) 



To summarize, the Taylor series for the effective low-energy Hamiltonian can be written as 
Hcs 



HoPo + ePoVPo + yPo^i(Kd)Po + jPoVodCVd{Si)Po 



T^VodicVdYiSi) + -VodCsliVod) + TTjSKVc 



6 
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odJ 



Po + Oie' 



(3.23) 



with the simplification Eq. (I3.22p in the case when the restriction of Hq onto the low-energy subspace 
is proportional to the identity operator. 



3.3 SchriefFer- Wolff diagram technique 

In this section we develop a diagram technique for the effective low-energy Hamiltonian. By analogy 
with the Feynman-Dyson diagram technique, we shall represent the n-th order Taylor coefficient 
Hes,n as a weighted sum over certain class of admissible diagrams. In our case n-th order diagrams 
will be trees with n nodes obeying certain restrictions imposed on the node degrees. Any such 



19 



diagram represents a linear operator acting on the low-energy subspace Vq. Loosely speaking, every 
edge of the tree stands for a commutator, while every node of the tree stands for Vd or V^d- The 
weight associated with a diagram depends only on node degrees and can be easily expressed in terms 
of the Bernoulli numbers. 

Let us now proceed with formal definitions. Let T be a connected tree with n nodes. Each node 
u & T has at most one parent node and zero or more children nodes. The children of any node are 
ordered in some fixed way. We shall denote children of a node u as «(!), . . . ,u{k), where k is the 
number of children (NOC) of u. Nodes having no children are called leaves. There is a unique root 
node which has no parent node. 

For any node m G T we shall define a linear operator Ou acting on the full Hilbert space "H. The 
definition is inductive, so we shall start from the leaves and move up towards the root. 

• If M is a leaf then 0„ = 5*1 = C{Vod)- 

• \i u ^ root has k >2 children and k is even then Ou = COu[i) ■ ■ ■ 0„(fc)(K)d)- 

• If M 7^ root has exactly one child then 0„ = —CVd{Ou{i))- 

• If u = root has k children with odd k then Ou = Po Ou{i) ■ ■ ■ Ou{k){Vod)Po- 

• In all remaining cases Ou = 0. 

Recall that O = ado describes the adjoint action of O. Define an operator 0{T) associated with T 
as 0{T) = Or, where r is the root of T. It is clear from the above definition that 0(T) is an operator 
acting on the low-energy subspace Vo and 0{T) has degree n in the perturbation V. Furthermore, 
0{T) = unless the root of T has odd NOC and every node different from the root with more than 
one child has even NOC. 

Definition 3.2. A connected tree T is called admissible iff' 

• The root ofT has odd NOC, 

• Every node different from the root has either even NOC or N0C=1. 

A complete list of admissible trees (modulo isomorphisms) with n = 3, 4, 5, 6 nodes is shown on 
Fig. [21 To illustrate the definition of operators 0{T) let us consider fourth-order diagrams. Let 
Ti, T2, T3 be the admissible trees with four nodes shown on Fig. [2] (listed from the left to the right). 
Then one has 



0(Ti) 
0{T,) 



-PoVodiCVdfiSi)Po, 

—PoVod^SliVod)Po, 

PoS!iVod)Po. 



(3.24) 
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Figure 2: A set of admissible trees T(n) with n = 3, 4, 5, 6 nodes. Only one representative for each 
class of isomorphic trees is shown. The total number of admissible trees (counting isomorphic trees) 
is |T(n)| = 1, 3, 7, 20 for n = 3, 4, 5, 6 respectively, 
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For any tree T define a weight 

w{T) = Y[w{u), (3.25) 
where the product is over all nodes of T and 



' M = root has k children for some odd k, 

otherwise. 



Recall that 



a2n = ,^ X, and ftsn-i 



(2n)! (2n)! 

where are the Bernoulli numbers, see also Tabled! The main result of this section is the following 
lemma. 

Lemma 3.2. The Taylor series for Hcs can be written as 

oo 

H,s = HoPo + ePoVPo + Yl Yl ^^w{T)0{T), (3.27) 

n=2 Tgr(n) 

where T{n) is the set of all admissible trees with n nodes. 

Proof. Let us first develop a diagram technique for the generator S. Let T be any connected tree. 
For any node u & T we shall define a linear operator O'^ acting on the full Hilbert space "H. The 
definition is inductive, so we shall start from the leaves and move up towards the root. 

• If M is a leaf then O'^ = Si = C{Vod)- 

• If M has exactly one child then O'^ = —CVd{0'^(^^-j). 

• If M has k children and k is even then O'^ = C 0'^(^^ " " " ^u(fc)(Kd)- 

• In all remaining cases O'^ = 0. 

Define an operator 0'{T) associated with T as 0'{T) = O'^, where r is the root of T. It is clear 
from the above definition that 0'{T) is a block-off-diagonal operator and 0'{T) has degree n in the 
perturbation V. Furthermore, 0'{T) = unless all nodes of T with more than one child have even 
NOC. 

Definition 3.3. A tree T is called S -admissible iff any node ofT with more than one child has even 
number of children. 
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Define a weight of a tree T as 



w 



(T) = n ^\^) 



(3.28) 



wliere tlie product is over all nodes of T and 



w'{u) 



1 if u has exactly one child, 
ttfc if u has k children for some even k, 
otherwise. 



(3.29) 



Lemma 3.3. The generator of the SW transformation can he written as 



oo 



e" w'{T) 0\T). 



(3.30) 



n=l TeT'in) 

where T'{n) is the set of all S- admissible trees with n nodes. 

Proof. We can use induction in n to show that Sn is the sum of w'{T)0'{T) over all S'-admissible 
trees with n nodes. The base of induction is n = 1 in which case there is only one S'-admissible 
tree T (a single node) and 0'{T) = Si. To prove the induction hypothesis for general n we can use 
Eq. (13. lip and Eq. f l3.12p . Repeatedly expanding S^^ in Eq. f l3.1ip one gets a sum over S'-admissible 
trees, where the operators O'^ correspond to S'„- encountered in the course of the expansion. □ 

Substituting the Taylor series Eq. f l3.30p into Eq. fl3.13p one arrives at Eq. fl3.27p . □ 
3.4 Convergence radius 

In this section we analyze convergence of the formal series S = J^JLi ^"^^ = J2'jLi ^esj ■ 
Lemma 3.4. The series for S and converge absolutely in the disk \e\ < pc where 



Here ec = A/(2||y||) and |Io| is the width of the interval To. 

Proof. Define projectors Qo = I — Pq and P = I — P. Simple algebra shows that 

Rp^Rp = I + Z, where Z = -2{PqQ + QqP). 

For any p > define a disk D{p) = {e G C : |e| < p}. It is well-known that P = P{e) has analytic 
continuation to -D(ec). Indeed, choose a contour 7 in the complex plane as the boundary of the 
(A/2)-neighborhood of the interval Xq, see Fig. [31 Then any eigenvalue of Hq has distance at least 



pc 




23 



y 



Figure 3: The contour 7 (dashed black curve) encirchng the interval Xq (bold red). Eigenvalues of 
Hq are indicates by solid dots. 

A/2 from 7 and \\{zl — Ho)^'^\\ < 2/A for all z G 7. It implies that for any 2; G 7 the resolvent 
[zl — H)^^ is analytic in -D(ec). Thus we can define 



P= — [ dzizi -Ho- eVy^ (3.3V 



and the Taylor series P = Pq + Yl^=iPj^^ converges absolutely in the disk -D(ec). Furthermore, 
one can easily check that P is a projector, P^ = P, for all e G -D(ec). Note however that P is not 
hermitian for complex e and thus one may have ||P|| > 1 (clearly, ||P|| > 1 for any non-zero projector 
P since {ijj\P\il!) = 1 for some state ip)- 

Using the standard perturbative expansion of the resolvent in Eq. (I3.3ip one gets a Taylor series 

00 „ 
P = Po + 5^P,■e^ = — dz{zI-Ho)-'{V{zI-Ho)-'Y . 

j=l '^^^ 

Noting that the contour 7 has length I7I = vrA + 2|Xo|, we can bound the norm of Pj for j > 1 as 
It follows that 

iiwii = iiPo - PoP\\ < Yl WPoP^^'W <Y.\\p^''\\^[^ + a)- < i 

i=i i=i ^ ^ / |e| 

where the last inequality holds for |e| < pc- The same bound applies to ||Qo-P||- K follows that Z is 
analytic in the disk D{p^) and \\Z\\ < 2(||Pog|| + ||Qo^||) < 1- Thus the series for S = \og{U) = 
(1/2) log(/ + Z) converges absolutely in the disk |e| < pc- □ 

3.5 Additivity of the effective Hamiltonian 

Consider a bipartite system of Alice and Bob with a Hilbert space 7/ = -H^®7/^. Assume that both 
Hq and V can be represented as a sum of local terms acting non-trivially only on Alice or Bob, 

Ho = ® + ® and V = ® + ® V^. 
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In this section we choose the low-energy subspaces Vq C "H^ and Vq C 'H^ as the ground state 
subspaces of Ahce's and Bob's Hamihonians Hq and respectively. Let C %^ and C "H^ 
be the ground state subspaces of the perturbed Alice's and Bob's Hamiltonians Hq + eV^ and 
Hq + tV^ respectively. Let U"^ and be the local SW-transformations on Alice's and Bob's 
subsystems such that 

U^P\U^)^ = P^ and t/^P^(t/^)t = Pg^. (3.32) 

Applying the SW-transformation independently to Alice's and Bob's subsystems one obtains a pair 
of effective Hamiltonians 

H^^ = P,^U\H^ + eV^){U^)^P,^ and H^^ = P^U^ {H^ + eV^){U^)^ P^ . 

Since there are no interactions between the two subsystems, the combined low-energy effective Hamil- 
tonian can be defined as P^ ® + ® P^. 

On the other hand, one can consider the global SW-transformation U^^ acting the full Hilbert 
space P = P^ ® Ti^ and mapping the ground subspace of Pq + eV to the ground subspace of Pq 
such that 

IjAB^pA ^ pB^^jjASy ^ pA ^ pB (3 33) 

It generates the effective low-energy Hamiltonian 

Hti" = (Po ® Po'^W^'^iHo + eV){U^^)\P,^ ® P^). 

As was pointed out in Section [Z!4l in general U^^ ^ ® U^. Nevertheless it turns out that the 
effective low-energy Hamiltonians obtained 'globally' and 'locally' are the same. We shall refer to this 
property as additivity of the effective Hamiltonian. The additivity will play the key role in Section H] 
in which we prove that the effective low-energy Hamiltonians describing weakly interacting spin 
systems inherit locality properties of the original Hamiltonian (the so called linked cluster property). 

Lemma 3.5 (Additivity). Let H^, P^, and be the low-energy effective Hamiltonians defined 
above. Then 

Hti" = ^eff ® + ® ^eff- (3.34) 

Proof. The additivity of Peg follows directly from the weak multiplicativity of the direct rotation, 
see Lemma [M Indeed, using Eq. fl233|l . Eq. (^M>, and Eq. (K33^ one gets 

(Po^ ® Po^)P^^ = (Pq^ ® Po'')(P^ ® U"") and (P^'')HPo' ® Po"") = (P^ ® P'')^(Po^ ® Po"") 
which implies Eq. fl3.34p . □ 

It is worth mentioning that additivity of does not apply to individual diagrams in its 

perturbative expansion, see Section 13.31 The simplest counter-example is a system of two qutrits 
(spins S = 1), that is, P^ = P^ = C^. Choosing a basis of as {|0), |1), |2)}, we can define 
Hamiltonians = = |2)(2| such that = = is spanned by |0) and jl). Choosing 
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and as generic (random) 3x3 hermitian matrices we computed numerically individual diagrams 
0(Ti), 0(T2), OiTs) that contribute to the fourth-order effective Hamiltonian H^^^^^, see Eqs. fl3.24p . 
Note that 0(Ti) are two-qubit operators acting on Vq^Vq = C^(X)C^. We observed numerically that 
(0, 0|O(Tj)|l, 1) 7^ for all i = 1,2,3. It shows that individual diagrams 0{T.j) contain interactions 
between A and B. 

4 Schrieffer- Wolff theory for quantum many-body systems 

In this section we specialize the SW method to many-body systems such as spin chains and spin 
lattices. The Hilbert space describing these systems has a a tensor product structure T-L = <S>a=i 
where Ha is a local Hilbert space describing the a-th spin and is the total number of spins. We 
shall assume that the unperturbed Hamiltonian Hq is a sum of single-spin operators, so that its 
low-energy subspace Vq by itself has a tensor product structure, Vq = <S>a=i'^o,ci, where Po,a ^ T^a 
is the low-energy subspace of the a-th spin. A perturbation V will be chosen as a sum of two-spin 
interactions such that each spin can interact only with 0(1) other spins. Our assumption on Hq and 
V are formally stated in Section 14.11 

The main technical difficulty with applying general techniques of Section [3]to many-body systems 
such as spin lattices is that the norm ||e\^|| is typically a macroscopic quantity proportional to the 
total number of spins A^. On the other hand, the spectral gap A of the unperturbed Hamiltonian 
Hq does not grow with A^. It means that ||e^|| ~ A^ S> A and thus the SW transformation is not 
well-defined, see Lemma 13.11 By the same token, the Taylor series for the SW generator S and 
the effective Hamiltonian i^efr derived in Section 13.21 could be divergent. We shall get around this 
difficulty by considering the Taylor series H^s = J2q>o Hcs,q truncated at some fixed order ra. 



Note that is a polynomial in e of finite degree, so it is well defined even if the series for H^q is 
divergent. 

Since the low-energy subspace Vq has a tensor product structure, one can ask whether the trun- 
cated Hamiltonian h'^ can be written as a sum of local interactions. In Section 1431 we use additivity 
of ifeff to show that a Taylor coefficient -ffes.q includes only interactions among subsets of spins that 
are spanned by connected clusters of q edges in the interaction graph, see Theorem O It demonstrates 
that the SW transformation maps a high-energy Hamiltonian with local interactions to an effective 
low-energy Hamiltonian with (approximately) local interactions. 

Our main result is the following theorem. 

Theorem 1. There exists a constant threshold Cc > depending only on n, A, and the maximum 
degree of the interaction graph such that for all \e\ < ec the ground state energy of h'^ approximates 
the ground state energy of Hq + eV with an error at most (5„ = 0{N\e\^^^). 



n 




(4.1) 



q=2 
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We begin by proving the theorem for the special case of so-called block-diagonal perturbations, 
see Section 14.21 These are perturbations composed of local interactions preserving the low-energy 
subspace Vq. For block- diagonal perturbations the SW transformation is trivial, that is, S" = 0, 
see Section 13. 2[ It follows that i^es = PqHq + ePoVPQ. Clearly, i^es can reproduce ground state 
properties of the exact Hamiltonian Hq + eV only if at least one ground state of Hq + eV belongs 
to the low-energy subspace Vq. We prove that this is indeed the case if |e| < for some constant 
> that depends only on the gap A, maximum degree of the interaction graph, and the strength 
of interactions in V , see Lemma 14. 1[ 

The role of the SW transformation is to map generic perturbations to block-diagonal perturba- 
tions. Unfortunately, the SW transformation cannot be used directly since the transformed truncated 
Hamiltonian U{HQ + eV)U'^ becomes local only when restricted to Vq, while Lemma Wl\ requires local- 
ity on the entire Hilbert space. In addition, the Taylor coefficients Sp of the SW generator are highly 
non-local operators which complicates an analysis of the error obtained by truncating the series at 
some finite order. We resolve this problem by employing an auxiliary perturbative expansion due to 
Datta et al [2] which we refer to as a local Schrieffer- Wolff tiansfoTmation, see Section It defines 
an effective low-energy Hamiltonian 

^cfr,ioc = Po e^{Ho + eV)e-^Po 

for some unitary transformation e"^. This transformation is constructed such that the transformed 
Hamiltonian e^{HQ + eV)e~^ is block- diagonal, that is, it can be written as a sum of (approximately) 
local interactions preserving Vo- Here the locality of interactions holds on the full Hilbert space 
which allows us to apply Lemma 14.11 In Section 14.41 we prove an analogue of Theorem [1] for the 
n-th order effective Hamiltonian constructed using the local SW method. The proof uses some of 
techniques developed by Datta et al [2] to bound the error resulting from truncating the series for 
the generator T. 

Finally, we construct a unitary transformation acting on the low-energy subspace Pq that 
maps the effective low-energy Hamiltonians obtained using the standard and the local SW methods 
to each other, that is, ifgff = e-'^ifeff.ioce"^, see Section 14. 5[ The relationship between low-energy 
theories obtained using the standard and the local SW methods is illustrated on Fig. HI We use 
weak multiplicativity of the SW transformation proved in Section 12.41 to show that the perturbative 
expansion of K contains only linked clusters. This linked cluster property of K allows us to bound 
the error resulting from truncating the series for K at the n-th order by 0(iV|e|"^^). Combining the 
two errors we obtain the desired bound on 6n in Theorem [H 

4.1 The unperturbed Hamiltonian and the perturbation 

Let A be a lattice or a graph with sites such that each site m G A is occupied by a finite-dimensional 
particle (spin) with a local Hilbert space "H^. Accordingly, the full Hilbert space is 
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We will choose the unperturbed Hamiltonian Hq as a sum of single-spin operators, 

//o = J]/^o,«, (4.2) 

where Hq u acts non-trivially only on a spin u. By performing an overall energy shift we can always 
assume that each term Hq^u is a positive semi-definite operator and the ground state energy of i?o,u 
is zero. Let Vq^u ^ 'Hu be the ground subspace of a spin u and Pq.u be the tensor product of the 
projector onto Vq^u and the identity operator on all other spins. Then the ground subspace of the 
entire Hamiltonian Hq is 

ueA 

and the projector onto Vq can be represented as 

Let Qo = I — Pq he the projector onto Vq and Qo.u = I — Pq,u- We shall assume that each term 
ifo,u has a spectral gap at least A above the ground state. Thus the full Hamiltonian Hq also has 
a spectral gap at least A. In general, the maximal norm of the local terms ||-f^o,u|| is a parameter 
independent of A. However, for the sake of simplicity we shall assume that 

WHq^uW = 0(A) for all m e A. 

It is worth mentioning that if we allow each spin to have two or more distinct levels in the low- 
energy subspace Vo_u, the full Hamiltonian might not have a spectral gap between the subspaces Vq 
and Consider as an example the case when each spin has 3 basis states |0), |1), |2) which have 
energy 0, 5 > 0, and A ^ 5 respectively such that the low-energy subspace Vq^u is spanned by the 
states |0) and |1). Then the state e Vq has energy N6 while the state |2,0®(^-^)) e Vq has 

energy A. Thus for any constant S and A the spectral gap between Vq and closes if > AS~^. 

We will choose the perturbation as a sum of two-spin interactions between nearest neighbor 
sites, 

(u,v)es 

Here £ is the set of edges of the underlying lattice (which could be an arbitrary graph) and Vu,v is an 
arbitrary hermitian operator acting on a pair of sites u,v. To describe the effective low-energy theory 
we will have to consider A,-/oca/ perturbations, that is, Hamiltonians with at most /c-spin interactions. 
A /c-local perturbation V can be written as 

V ^^Va, Va^O unless \A\ < k. 

ACA 
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Here Va is an arbitrary hermitian operator acting on a subset of spins A. Define a strength of the 
perturbation as 



The strength of the perturbation provides an upper bound on the combined norm of all interactions 
affecting any selected spin. If A is a regular lattice in a space of bounded dimension then J coincides 
up to a constant factor with the maximum norm of interactions, J ~ max^ II Kill- We will always 
assume that J and A are constants independent of N. Let us say that a perturbation V = X^^ca ^ 
is block- diagonal iS Va preserves the low-energy subspace Vq, or equivalently, [Va, -Po] = 0. 

4.2 Block- diagonal perturbations 

Recall that the goal of the SW transformation U is to transform the perturbed Hamiltonian Hq + eV 
into a block-diagonal form such that the transformed Hamiltonian U{Hq -\- eV)U^ preserves the low- 
energy subspace Vq. In this section we analyze the special case when the perturbed Hamiltonian 
Hq + eV is already block-diagonal, so that U = I . \n this case the effective low-energy Hamiltonian 
is simply ifeff = Po{Hq + eV)Po. Since we assumed that HqPq = 0, one has Hes = ePoVPo- A 
natural question is whether the ground state energy of H^s coincides with the ground state energy of 
Hq + eV. Clearly, this happens iff at least one ground state of Hq + eV belongs to the subspace Vq. 
In this section we prove that this is indeed the case provided that the strength of the perturbation 
eV is below certain constant threshold depending only on A and k. Intuitively it follows from the 
fact that exciting any subset of w spins increases the energy of the term Hq by at least wA, while 
the energy of the term eV can go down at most by 2w\e\J. Hence exciting spins can only increase 
the energy provided that |e| < ec ~ A/J. Unfortunately this simple argument does not tell anything 
about states that contain superpositions of different excited subsets with different w, and one may 
expect that exciting subsets of spins "in a superposition" can help to reduce the overall energy. We 
analyze this general situation in the following lemma (to simplify notations we include the factor e 
into the definition of V) . 

Lemma 4.1. Let V = XIaca k-local perturbation with strength J. Suppose that each term 

Va is block-diagonal, that is, Va preserves the subspace Vq. Assume also that 2*^+^7 < A. Then the 
subspace Vq contains at least one ground state of Hq + V. 

Let us emphasize that the lemma does not tell anything about the spectral gap of the perturbed 
Hamiltonian Hq -\- V separating the subspaces Vq and Vq. We believe that this gap can be closed 
by perturbations with a strength of order 1/N. 

Proof. Let us assume that Hq -\-V has a ground state \ip) G Vq. We will show that this assumption 
leads to a contradiction. Indeed, we can represent such a ground state j'^) as a sum over 2-^ — 1 
configurations of excited spins. 



J = max 

ueA 




(4.3) 



ACA-.ABu 




(4.4) 



xe{o,i} 
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where a; is a binary string of length N such that x„ = 1 indicates that a spin u is excited (Po,u l^Px) = 0) 
and Xu — indicates that a spin u is not excited (-Po,« \i^x) = li^x))- More formally, 

iV'x) = R{x) IV'), where R{x) = - a^«)-Po,« + XuQo,u)- 

We note that {ipx) may be a highly entangled state. Since we assumed l'^) e V^, the configuration 
with no excited spins does not appear in that is, 

\M = 0. 

Let us choose a configuration x* e {0, 1}^ with the largest amplitude 

ax = Mipx), 

that is, 

Oix* ^ for all X. (4.5) 

The key idea is to transform l'^) to a new state rj (which is generally mixed) by "annihilating" all 
excitations in \ipx*) while leaving all other components \ipx), x x*, untouched. We will show that 
for sufficiently small strength of V the energy of r) is smaller than the energy of j'^) thus arriving at 
a contradiction. 

For any spin u E A choose an arbitrary ground state \uju) G Po.m- Define a trace preserving 
completely positive (TPCP) map that erases the state of the spin u and replaces it by the chosen 
ground state \uju)- More formally, for any (mixed) state r describing the entire system one has 

£„(r) = ® Tr„(r). 

Note that Su{t) may be mixed even if r is a pure state. The map can be regarded as an 
"annihilation superoperator" . Obviously, annihilation superoperators on different spins commute 
with each other. Let p be a state obtained from \ipx*) by applying the annihilation superoperator to 
every excited spin, that is, 

I n ^« iiMiM)- 

\u:x*=l J 

Clearly, reduces the energy of the term Hq ^ at least by Aa^* for any excited spin u. Thus if we 
denote the number of excited spins (the Hamming weight of the string x*) by w then 

Tr(pHo) < {i^xAHo\i^:,*) -wAax*. (4.6) 

Now consider a state obtained from \ip) be removing the largest-amplitude term, 

IV'else) = XI 1^^^- 
x^x* 
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Denote 

= P + |?/'else)(^else|- (4.7) 

Since are trace preserving maps, we have Trp = ('i/'a;* iV'x*) and thus Trr^ = {^\^) = 1. Let us 
show that for sufficiently small strength of V the energy of r) is smaller than the energy of {ip), thus 
getting a contradiction. Indeed, since Hq cannot change a configuration of excited spins we have 



see Eq. (14.61) . It remains to bound the contribution to the energy coming from the perturbation V. 
Simple algebra leads to 



(4.9) 



ACA 



Note that the states p and \ipx*) have the same reduced density matrices on any subset A C A that 
contains no excited spins, that is, x* = for all u & A. It follows from the fact that annihilation 
superoperators applied outside A cannot change the reduced state of A. Therefore 



Tr (pVa) - (V'x* iKilV'.-) = if x: = for all u E A. 



(4.10) 



Taking into account that any interaction Va is block- diagonal, the operator Va can change a config- 
uration of excited spins only if at least one spin in A is already excited. It means that 



{i'xAVA\'4'else) 

Using Eq. fl4.10p we get a bound 



if X* = for all u e A. 



(4.11) 



ACA 



< Yl $^|Tr(pK4)-(^x*|^A|^x*)| <2u;a,.J. (4.12) 

u : X* =1 A^u 



It bounds the contribution to the energy from the first and the second terms in Eq. (14. 9p . Now let 
us bound the contribution from the last term in Eq. (14. 9p . Using Eq. (14. lip we get 



J2'^M{MVA\i^else)) 



ACA 



m;xJ = 1 A3u xj^x* 

5^ j^^'^^'wvaWVc 



< 



u:x*=\ A^u 



< 2 ■2'' wax' J. 



(4.13) 



Here the first line follows from Eq. (14. lip and definition of \4'eise)- The second line follows from the 
Cauchy-Schwarz inequality and the fact that Va acts non-trivially on at most k spins, so the number 
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of possible transitions x* — )■ x is upper bounded by 2*^. The third hne follows from the maximality 
condition Eq. (14. 5p . Combining bounds Eq. fl4.12f4.13p we arrive at 

|Tr {r]V) - {ij\V\ij)\ < 2'^+W^. J. (4.14) 

Combining it with Eq. (14. Sp we finally get 

Tr {7]H) - {iIj\H\iIj) < (-A + 2^+^J)wa^*. (4.15) 

Since by assumption x* contains at least one excitation, we have w > 0. It means that the energy 
of r] is strictly smaller than the one of lip) if 2^^"^ J < A. However this is impossible since lip) is a 
ground state. Thus our assumption {tp) G leads to a contradiction. □ 

Lemma 14.11 can be easily generalized to the case when V contains interactions among arbitrarily 
large number of spins, but the magnitude of fc-spin interactions decays exponentially with k. 

Corollary 4.1. Consider a Hamiltonian V = Y12=i ^ such that Va is a k{a)-local perturbation with 
strength J{a). Suppose that all interactions in each perturbation Va are block- diagonal. Assume also 
that 

n 

^2^W+2 J(a) < A. (4.16) 

a=l 

Then the subspace Vo contains at least one ground state of Hq + V. 
4.3 Linked cluster theorem 

Let e^^v be formal variables associated with the edges of the lattice {u, v) G S. Consider a perturbation 

Using the perturbative expansion described in Section l3^ one can represent the effective low-energy 
Hamiltonian Hes acting on Vo as a formal operator- valued multivariate series in variables eu,v Clearly 
any Taylor coefficient Hes,n is a homogeneous multivariate polynomial in eu^v of degree n. One can 
uniquely represent Hes,n as a sum of monomials 

{u,v)ec {u,v)ec 

where K is some operator acting on Vo and C C £ is a subset of edges of the lattice. We shall refer 
to C as the support of the above monomial. Thus we have 

H,s,n = J2^r.,c, (4.17) 

cce 
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where Kn,c is defined as the sum of all monomials in Hes that have support C and degree n. Given 
any set of edges C <^ £ let A(C) C A be the corresponding set of sites, such that u e A(C) iff C 
contains an edge incident to u. 

Let us point out that the only purpose of introducing the variables e^^v is to define the operators 
Kn,c and the decomposition Eq. fl4.17p . Once this decomposition is defined, we can set e„_„ = e for 
all edges. Also note that we do not make any assumptions about convergence of H^s considered as 
a multivariate serie^ since we only consider some fixed order n. The main result of this section is 
the following theorem. 

Theorem 2 (linked cluster theorem). The operator Kn,c' ocfo only on the subset of spins A(C). 
In addition, Kn,c = unless C is a connected subset of edges and \C\ < n. 

Thus the n-th order effective Hamiltonian includes only interactions among "linked clusters" of spins 
of size at most n. However let us point out that the theorem does not say that K^^c acts non-trivially 
on all spins in A(C). 

Proof. Choose any subset of edges C £ and set eu,v = for all (m, v) ^ C. Partition the lattice 
as A = AB where A = A(C) and B is the complement of A. Then we have V = ^ for some 
operator acting on A. Since the unperturbed Hamiltonian Hq does not include any interactions, 
additivity of the effective Hamiltonian implies Hes,n = I^, where ^ is the effective low- 

energy Hamiltonian computed using only the subsystem A with the perturbation V^, see Lemma [23] 
in Section 1X51 We conclude that all monomials in the decomposition of Hes,n act trivially on B. By 
definition, Kn^c does not depend on the variables e^^^ with {u, v) ^ C, so we infer that i^n,c acts 
trivially on B. 

Suppose now that C consists of two disconnected subsets of edges Ci and C2. Choose A = A(Ci) 
and B = A{C2). Again we set e^.,^ = for all (m, v) ^ C. Then we have V = ® + I"^ ® V^, 
where ® is obtained from V by setting e^^v = for all {u,v) G C2. Similarly, ® is 
obtained from V by setting eu^v = for all [u, v) E Ci. Additivity of the effective Hamiltonian implies 
ifeff.n = -f^rfF,„ ® + ® H^ ,^, where and H^ ,^ are the effective Hamiltonians computed 

using only the subsystems A and B with perturbations and respectively. It means that no 
monomial in -ffeff.n contains variables e„ t, from both Ci and C2 simultaneously. Since Kn^c does not 
depend on the variables e„^^ with {u,v) ^ C, we infer that Kn^c = 0. 

The statement K^^c = for |C| > n follows from the fact that i/eff,n contains only monomials of 
degree n. □ 

It is obvious from the proof that the linked cluster theorem applies to any operator-valued mul- 
tivariate series that obeys the additivity property. 

■^Let us remark that absolute convergence of the univariate series iJoff — ^cff.ne" does not imply absolute 

convergence of the corresponding multivariate series, since different terms in the multivariate series may cancel out 
each other when combined into a single term iJoff.n- 
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4.4 Local Schrieffer- Wolff trasformation 

Applying the SW transformation described in Section [3] to many-body systems one must keep in 
mind that its generator S" is a highly non-local operator even in the lowest orders of the perturbative 
expansion. This follows from the fact that S" is a block-off-diagonal operator, that is PqSPq = and 
QqSQq = 0, see Lemma I2.3[ One can easily check that any block-off-diagonal operator must act 
non-trivially on every spin in the system. Indeed, block-off-diagonality implies S = PqS + SPq. If 
one assumes that S acts trivially on some spin u, then Qo^uSQo,u = SQo^u 0. On the other hand, 
Qo,uSQo,u = Qo,u{PoS + SPo)Qo,u = since Qo,uPo = PoQo,u = 0. The non-locality of S is not a 
very serious drawback if the main object one is interested in is the effective low-energy Hamiltonian 
i^eff- Indeed, the linked cluster theorem proved in the previous section implies that all low-order 
corrections to -ffeff are in fact local operators. On the other hand, the non-locality of S makes it 
more difficult to analyze properties of -ffeff- In this section we get around this difficulty by employing 
a local version of the SW transformation developed by Datta et al [2j. To avoid a confusion between 
the two types of the SW transformation we shall refer to the direct rotation described in Sections [2|3] 
global SW transformation. 
The local SW transformation is a unitary operator U that brings the perturbed Hamiltonian 
Ho + eV into an approximately block-diagonal form. Specifically, for any fixed integer n > we shall 
choose U = where T is a degree-n polynomial in e, 

n 
q=l 

This polynomial will be chosen such that the transformed Hamiltonian e^(ifo + ^V)e~'^ is block- 
diagonal if truncated at the order n. The definition of T is similar to the perturbative definition of 
5", see Section 13.21 We use the Taylor expansion of exp (T) to get 

n oo 
j=l j=n+l 

where V^^^ is a certain polynomial function of Ti, . . . , Tmm{j,n) applied to Hq or V. The precise form 
of this polynomial will be given in Eq. (I4.27p . Strictly speaking, V^^^ should also carry a label n, but 
since n is fixed throughout this section, we shall only retain the label j. 

The main difference between the local and global SW method is the inductive rule used to 
define the generators of the transformation. In the global SW method the generators Sj are fixed 
by demanding that Sj is block-off-diagonal while the j-th order contribution to the transformed 
Hamiltonian is block-diagonal, see Section 13. 2[ In the local SW method the generators Tj are fixed 
by demanding that Tj is a sum of local interactions (the range of interactions may depend on j) and 
each interaction is locally block-off-diagonal on the subset of spins it acts upon. For each local term 
in the expansion of V^^~^'' we shall introduce the corresponding local term in Tj chosen such that the 
expansion of [Tj,!!^] + V^^~^^ contains only block- diagonal terms. 
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To define the generators Tj formally let us introduce some notations. For any subset of spins 
A C A define projectors onto the local low-energy and high-energy subspaces 

Pa = YIPou and Qa = I-Pa. (4.19) 

Introduce superoperators Oa and Va such that 

Oa{X) = PaXQa + QaXPa and Va{X) = PaX Pa + QaX Qa- 

Clearly, X = Oa{X) + Va{X) for any operator X. We shall also need a local version of the 
superoperator £, see Eq. (13.61) . It can be chosen as 



/•oo roo 

Ca{X)= dte-'^'^-^QAXPA- dtPAXQAC 
Jo Jo 



tHo,A 



where Hqa = '^ueA^o,u- Taking into account that ||e ^^°'^Qa\\ < e '^^ one can easily check that 
the integral over t converges and 

II^^WII<^- (4.20) 
In addition, for any operator X that acts non-trivially only on spins in A one has an identity 

Ca{[Ho,X]) = [Ho,Ca{X)] = Oa{X). (4.21) 

Now we are ready to define the generators Tj. We begin by representing V^^^ as a sum of local 
interactions, 

ACA 

Here vj^'^ acts non-trivially only on A (for example, one can use the expansion of V^^^ in the gener- 
alized Pauli basis). Then we choose 

T,- = £a(v1^"^^) for all I < j < n. (4.22) 

ACA 

Using the identity Eq. fl4.2ip one can easily check that for this choice of Tj 

[Tj, Ho] + V^^-^^ = J2 ^^(Ki^^^^) for all 1 < J < n. 

ACA 

Accordingly, the transformed Hamiltonian can be written as 

e^{Ho + eV^)e-^ = H^^^ + //garbage, (4.23) 



35 



where 

n 

=H, + Y,^^Y. ^^(^i'"'^) (4-24) 
is a sum of local block-diagonal interactions, while 

oo 

i^garbage = V^^'^^ (4.25) 

j=n+l 

includes all unwanted terms generated at the order n + 1 and higher. Finally, we define the n-th 
order effective low-energy Hamiltonian as a restriction of if^"^ onto the subspace Pq, 

i/i^ = Po^^"^Po. (4.26) 

It is obvious from this construction that h'^ is additive for perturbations describing two non- 
interacting disjoint systems, see Lemma [3.51 It follows that the Taylor coefficients of h'^ obey the 
linked cluster property of Theorem [21 

For the later use let us give an explicit formula for the polynomial V^^\ We have V"*^*^-* = V and 

9=2 ' l<j\,...jq<n q=l ' l<ji,---jq<n 

jl + ---+jq=j jl + ---+jq=j-i 

for all J > 2. Note that convergence of all above series in the limit n — )■ oo does not matter since 
we always consider some fixed order n. However, it can be shown [2] that the — )■ oo limits of the 
series for T and Pjg^ converge absolutely in the disk of radius ec ~ l/N"^. 

Below we shall prove that the norm of -^garbage can be bounded by 0(A^|e|"'''^) for sufficiently 
small e. Then we shall employ Lemma 14.11 and its Corollary 14.11 to show that the ground state of 
iJ^"'* belongs to the subspace Vq for sufficiently small e. Combining the two results together we shall 
get the following theorem which can be regarded as an analogue of Theorem 4.7 in |2]. 

Theorem 3. Let h'^ he the n-th order low- energy effective Hamiltonian constructed using the local 
SW method. There exists a constant threshold 

ec = n{An-^) (4.28) 

such that for all \e\ < ec the ground state energy of h'^ approximates the ground state energy of 
Hq + eV with an error 6n = 0{N\e 



\n+l\ 



Given any perturbation V represented as a sum of local interactions, V = ^^ca ^ shall use 
the notation ||V^||i for the strength of V, that is, 

II V"||i = max } \\Va\\- 
ueA ^ — ^ 

A3u 
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All derivations below will implicitly use a bound 

J2 ll^^ll ^ l^^l ■ ll^lli (4-29) 

that holds for any operator V = J2aca ^ subset of spins MCA. In particular, choosing 

M = A we get a bound 

\\V\\<J2\\^^\\^^-\\ni- (4.30) 

ACA 

Applying Eq. f l4.30p we can bound the norm of -^garbage as 

oo 

ll^^garbagell = ||e^(i/o + eV^)e-^ II < AT ^ |ep.||vO-i)||,. (4.31) 

j=n+l 

Thus our task is to get an upper bound on the norm ||l^^-'-'||i. To simplify notations we shall assume 
that the original perturbation V has strength ||V^||i = 1. 

Lemma 4.2. The operators Tj and V^^~^^ are (j + l)-local Hamiltonians. There exists constants 
a, /3 > such that 

ll^^^'^l|i<«(^)' (4.32) 

for all j > 0. 

Let us first explain how Theorem [3] follows from Lemma [4.21 Define pc = A/3n~^. Assume for 
simplicity that e > 0. Substituting Eq. f l4.32p into Eq. fl4.3ip one gets 

oo 

lli^garbagell < aA^ ^ e^^^' < 2aP ANn'^e / p^T^' (4.33) 

i=n+l 

provided that e < Pc/2. Using Corollary 14.11 of Lemma [4.11 and the fact that V^^~^'' is (j + l)-local 
Hamiltonian we conclude that the ground state of H^"''^ belongs to the subspace Vq whenever 

n 

^23+^'.|ep.||\/(^'-i)||i< A. 
i=i 

Using the bound Eq. fl4.32p we get 

n oo 

^2^+^' . \e\^ ■ ||l^(^'"^^||i < 16ae^(2e/p,)^' < 32ae < A 

j=l j=0 
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provided that e < pc/4 and 32ae < A. If we choose 



min 



4^' 32^ 



then for all e < ec the ground state of if^"^ belongs to the subspace Vq and ||-ffgarbagc|| = 0(A^e"+^). 
Theorem [3] now follows from the fact that iJ^"^ + -^garbage has the same spectrum as Hq + V. In the 
rest of the section we prove Lemma I4.2[ 

Proof of Lemma 14. 2i Let us first analyze locality of Tj and V^^~^\ 

Proposition 4.1. Tj is a (j + l)-local anti-hermitian Hamiltonian. V^^^^'^ is a {j + l)-local Hamil- 
tonian. 

Proof. We shall use the fact that for any a-local Hamiltonian X and 6-local Hamiltonian Y the 
commutator [X, Y] is at most (a + 6 — l)-local. By assumption, V^^^ = is a 2-local Hamiltonian. 
Using Eq. fl4.22p we conclude that Ti is a 2-local Hamiltonian (anti-hermitian). Let us use induction 
to prove that Tj and V^^~^^ are (j -|- l)-local. Indeed, from Eq. fl4.27p one can see that l/^-'-^) IS a 
sum of terms proportional to Tj^ ■ ■ - Tj^^Ho) with ji + ■ ■ ■ + jq = j, > 2, and terms proportional 
to Tj^ ■ ■ -Tj^iy) with ii + . . . + iq = j — 1, q > 1. The terms of the first type are at most k'- 
local with k' = ji + . . . + jq + 1 = j + 1. The terms of the second type are at most fc"- local with 
k" = 2 + ji + . . . + jq = j + 1. Thus V^^-""^ is j + 1-local. Using Eq. (KT2\i we conclude that Tj is 
j + 1-local. □ 

Recall that the truncated Hamiltonian if^"^ is computed using a finite number of generators 
Ti, . . . , T„. Thus any generator Tj is at most {n + l)-local. 

Proposition 4.2. Let W be an r-local Hamiltonian. Then for any j < n one has 

\\fj{W)\\, < +2 + \\V^^~'Hi ■ \\W\\i- (4.34) 
Proof. Indeed, expand Tj and W as a sum of local operators, 

MCA ACA 

where \M\ < j + 1 < n + 1, see Proposition 14. H and \A\ < r since W is r-local. Here Tj is related 
to V^(J-i) by Eq. (KT2^ . that is, 

MCA 
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Let M G A be the spin that maximizes the norm ||rj(14^)||i. Then 

MBu A:AnM^0 A3U M : AfnA^0 

< liiw^iii E 1^1 • ii^M^'^ii + |ii^^^'"^^iiiEi^i ■ ii^^ii 

M3u Abu 



A\ 



^ 2(r + n + l 



□ 

Now we can apply Proposition 14.21 to bound the norm of any term in Eq. fl4.27p . Note that 
ig composed of two types of terms. We have terms proportional to = Tj^ ■ ■ ■ T^^ (i^o) 
with 1 < ji, . . . , jq < n, ji + . . . + jq = j, and terms proportional to Bj = Tj^ ■ ■ ■ Tj^iV) with 
^^ji,---,jq^n, ji + ...+jq=j — 1. Note that applying q commutators Tj^ ■ ■ ■ Tj^ to Hq or V we 
can produce at most 0(g?7,)-local operator. Applying Proposition 14.21 we get 

ll^jlli < 0{nA-yq''-A-\\V^^'-^'>\\i---\\V^^''-^^\\i, 

< 0{nA-yq''-\\V^^'-^^\\i---\\V'^^''-^^\\i (4.35) 

where we have taken into account that ||V^||i = 1 and ||i^o||i = 0{A). Let us introduce an auxiliary 
quantity 

X, = ll^^'^'^lli, J>1- (4.36) 
Using Eqs. (I4.27f4.35p we can get an upper bound Xj ^ /^i? where 

AH = Xi = ll^lli = 1 

and /ij, j > 2, are defined inductively from 

j i-i 
/ij = A E E ■ ■ ■ /^i, + E E ■ ■ ■ ^J'^ ' ^^•2'^) 

q=2 ii+.-.+jq^j q=l ji + ---+jq=j-l 

where c = 0{nA^^). Here we have taken into account that q'^/ql = 0(1)'^ and omitted the constraint 
ji, ■ ■ ■ ,jq < ^ since we are interested in an upper bound on Xj (note that adding this constraint can 
only make fij to grow more slowly with j). In order to get an upper bound on fij define an auxiliary 
Taylor series 

00 

fi{z) = J2f^j^'- (4-38) 
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Inspecting the recursive equation Eq. fl4.37p one can easily convince oneself that fi{z) has to obey 
the following equation 



Solving this equation one can easily find the inverse function: 

z{iJ,) = /i — /i^(c + Ac^). 

Clearly z{fi) is analytic in the entire complex plane. Solving the quadratic equation and choosing 
the branch for which fi{0) = one gets 



2(c + Ac2) 

Clearly fj,{z) is analytic in a disk \z\ < zq where 

1 

^° 4(c+ Ac2) - n2 

for some constant (5 > 0. Here we have taken into account that c = 0{nA~^). Using the Cauchy 
integral formula for fi{z) one gets 

which yields 

//3A^^-^■ 



for some constant a > 0. The lemma is proved. □ 



4.5 Equivalence between the local and global SW theories 

In this section we prove Theorem [H The key idea of the proof is to construct a unitary operator 
acting on the low-energy subspace Vo that transforms the n-th order effective low-energy Hamiltonian 
obtained using the local SW method to the one obtained using the global SW method with an error 
0{N\e\^^^) in the operator norm. We illustrate the relationship between the two low-energy theories 
on Fig. m Once the desired unitary transformation is constructed, Theorem [1] follows directly 
from Theorem 131 To introduce the notations let us first consider the limit n — )■ oo. Let 

oo oo 

S = ^Sje^ and T = J]T,e^' (4.39) 
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Figure 4: Correspondence between the high-energy theory Ho + eV and the effective low-energy the- 
ories obtained using the global and the local Schrieffer- Wolff transformations. The unitary operator 
defined on the low-energy subspace Vo maps the two effective low-energy Hamiltonians to each 
other. 

be generators of the SW transformation calculated using the global rule (S) and the local rule (T). 
Here we regard S and T as formal Taylor series. Let 

i/gi = e^Ho + eV)e-^ and H^°' = e^{Ho + eV)e-^ 

be the corresponding transformed Hamiltonians which we also consider as formal Taylor series. Let 
j^ioc,(n) g^j^^ ^gi,(n> ^]^g Hamiltonians obtained from H^°^ and H^^ respectively by truncating the 
series at the order n. 

Theorem 4. Let 

L = Po^'°''^"^Po and M = Pq m^'^"^ Pq 

be the effective order-n Hamiltonians on the low-energy subspace obtained by the local and the global 
SW methods. There exists a constant (3 depending only on n, A, and the maximum degree of the 
interaction graph such that 

\\M - e^Le-^\\ < /3A^|e|"+i for all \e\ < 1 (4.40) 

for some unitary operator acting on the subspace Vq. 

A natural way to define the desired unitary transformation is 

e^ = e^e-^. (4.41) 

Indeed, one should expect that e~'^ maps the local low-energy to the high-energy theory and e"^ maps 
the high-energy theory to the global low-energy theory. The main difficulty in using Eq. f l4.4ip is 
that (i) it defines a unitary transformation on the entire Hilbert space H while we expect the two 
theories to be close only on the low-energy subspace Vq; (ii) the formal series K, S, and T may be 
divergent. 
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The proof of Theorem H] goes as follows. We shall regard Eq. (14.411) as a definition of a formal 
Taylor series 

oo 

i=i 

Here Kj are some operators acting on the entire Hilbert space "H. We will show that operators Kj 
are actually block- diagonal, so we can apply the transformation on the subspace Vq only, see 
Lemma 14. 3[ even if one uses a truncated version of K. Next we shall make use of the fact that 
the SW operators and have (weak) multiplicativity properties, see Lemma 12.41 It will allow 
us to prove multiplicativity for and thus additivity of K, see Eqs. (I4.46p . Next we shall exploit 
the additivity of K to show that the series for K obeys the linked cluster condition analogous to 
Theorem |2l Finally, we shall construct the desired transformation mapping the local theory to the 
global theory as exp (i^^""^), where Ji'^"^ = YTj=i^j ■ Lemma [4.41 shows that this transformation 
indeed does the right job. 

Lemma 4.3. Suppose Hq has a positive spectral gap A between Vq and Vq. Then all generators Kj 
are block- diagonal. 

Proof. By definition we have 

iJgi = H^"" e-^ (4.42) 

and both operators H^^, H^°'^ are block-diagonal. It is assumed here that H^^ and H^°'^ are formal 
Taylor series in e Let us multiply Eq. (I4.42p by Pq and Qq on the left and on the right respectively. 
Taking into account that H^^ and are block-diagonal, and using formal Taylor series H^°'^ = 
Ej°lo wi^h Hi""" = Ho one obtains 

Po[K„ Ho]Qo + PofiK,, H'r, H'pQo = for all j > 1, (4.43) 

where f{Ki, . . . , Kj^i, . . . , Hj°'^) is some polynomial obtained from the expansion of the expo- 
nent in Eq. (I4.42p . Let us use induction in j to show that Kj is block-diagonal. Indeed, suppose we 
have already proved that Ki, . . . , Kj^i are block-diagonal. Then the term 

PofiK,,...,K,^,,H'r,...,H!^nQo 
in Eq. (I4.43P vanishes and we can rewrite Eq. (I4.43P as 

{PoK,Qo)iQoHoQo) = {PoHMiPoK.Qo). (4.44) 

If lip) G Qo is any eigenvector of QqHqQq with an eigenvalue A, then Eq. (I4.44p implies that a 
state {PqKjQq) \ip) is an eigenvector of PqHqPq with the same eigenvalue A. On the other hand, 
{PqKjQq) \ip) G Vq. It contradicts to the spectral gap assumption unless {PoKjQq) \ip) = 0. Since 
eigenvectors of QqHqQq span the entire subspace Qo, we conclude that PqKjQq = 0. Since K is 
anti-hermitian, it implies QqKjPq = 0, that is, Kj is block-diagonal. □ 
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Let us now prove that the restriction of K onto the Vq subspace obeys the additivity property: 
if one partitions the lattice A into two disjoint parts, A = AB, such that V = Va + Vb then 
K = Ka + Kb (only on the subspace Vq). Note that the generator T constructed using the local SW 
method is additive by construction, so we have T = T4 + Te on the entire Hilbert space (not only on 
the subspace Vq). On the other hand, weak multiplicativity of the direct rotation, see Lemma \'2A\ 
implies 

Since = e^e~^, we conclude that the restriction of K onto the subspace Vq is additive, that is, 

e-^ Po = e"^^-^^ Po (4.46) 

By abuse of notations let us identify K, Ka, Kb with their restrictions of the on the subspace Vq. 
Then we have = e'^^e'^'^ = e"^-^^'^^. Taking the logarithm we obtain the desired additivity 
property, K = Ka + Kb- It implies that the series for K obeys the linked cluster condition, see 
Theorem |2j Using the same notations as in the statement of Theorem |2] we conclude that 



j,c, 



cce 



where Kj^c acts only on A(C) and Kj^c = unless \C\ < j and C is a connected subset of edges. 
Define an interaction strength of K as 



Jk = max max > ||/C-^c'||. (4.47) 

uGC j=l,...,n 

C3U 

We claim that 

Jk = 0(1), (4.48) 

that is, Jk has an upper bound independent of A^. Indeed, assuming that the interaction graph has 
degree c? = 0(1), the number of linked clusters of size < n containing a given vertex u can be upper 
bounded by dP^^^ = 0(1). Additivity of K implies that we can compute Kj^c by turning off all 
interactions Vu^^, (w,f) ^ C. It leaves us with a subgraph with at most A^' = 2\C\ < 2j < 2n = 0(1) 
sites. Since the number of spins is 0(1), any operator Kj^c is a const ant- degree polynomial with 
constant-bounded coefficients, so that HA'j^c'll = 0(1). Theorem H] now follows from the following 
lemma. 

Lemma 4.4. Let K, L, M be formal operator-valued Taylor series acting on the subspace Vq such 
that K is anti-hermitian, K{0) = 0, and 

M = e^Le-^. (4.49) 
For any integer n = 0(1) define truncated series 

n n n 

k = Y,K,e^, L = Y,L,e^, M = J]M,e^- (4.50) 

j=l j=0 i=o 



43 



and a truncation error 

5=\\M-e^Le-^\\. (4.51) 

Suppose K,L,M obey the linked cluster property and have constant- bounded interaction strength, 
Jk,Jl,Jm = 0{1). Then 

6 < 0(l)A^|e|"+^ for any \e\ < 1. (4.52) 
Proof. Introduce a truncated version of a transformed operator e^Le~^, namely, 

1 . 

= ^ + E-y[^'-H^- 

The assumptions that K obeys the hnked cluster property and K{0) = imply that iK can be 
regarded as a local Hamiltonian with interaction strength upper bounded by S"=i I^P = 0(|e|). 
Similarly, L can be regarded as a local Hamiltonian with interaction strength 0(1). Using Lemmas 1,2 
from [21] one gets an approximation 

lle^L e-^ - eil < II [k, -r^'L \\ < 0{l)N\er\ (4.53) 

Therefore it suffices to show that 

||M- 0|| < 0(l)A^|e|"+\ (4.54) 
Expanding the multiple commutators in 6 we get a linear combination of terms 

i. ,.1+92+...+..+. ^g^K,, ■ ■ ■ kg^{L,), 1 < gi, . . . , < n, I < j < n, 1 < q < n. 

Let us call a term as above good if the total power of e is at most n, that is, qi + q2 + ■ ■ ■ + qj + q < n. 
Otherwise let us call such a term bad. Using the condition Eq. (14.491) one can easily check that the 
sum of all good terms in 9 equals M. Thus we get a bound 

||M - 0|| < |er+^ #{bad terms} max \\kg,kg, ■ ■■kg^{Lq)\\. (4.55) 

Clearly for a constant n the total number of terms in B is 0(1) and so the number of bad terms is 
0(1). Applying again Lemma 2 from [24] and taking into account that the Taylor coefficients Kq^ 
and Lq are local Hamiltonians with interaction strength 0(1) we get a bound 

ma.lc\\kq,kq,---kq^{Lq)\\<0{l)N. 

Therefore \\M - e|| < 0(l)A^|e|"+\ □ 
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